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Abstract 

Scale insects (Hemiptera: Coccoidae) constitute a very diverse group of sap-feeding insects with a large diversity of symbiotic asso- 
ciations with bacteria. Here, we present the complete genome sequence, metabolic reconstruction, and comparative genomics of the 
flavobacterial endosymbiont of the giant scale insect Llaveia axin axin. The gene repertoire of its 309,299 bp genome was similar to 
that of other flavobacterial insect endosymbionts though not syntenic. According to its genetic content, essential amino acid bio- 
synthesis is likely to be the flavobacterial endosymbiont's principal contribution to the symbiotic association with its insect host. We 
also report the presence of a y-proteobacterial symbiont that may be involved in waste nitrogen recycling and also has amino acid 
biosynthetic capabilities that may provide metabolic precursors to the flavobacterial endosymbiont. We propose "Candidatus 
Walczuchella monophlebidarum" as the name of the flavobacterial endosymbiont of insects from the Monophlebidae family. 
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Introduction 

Insects have specialized symbioses with certain bacteria that 
provide diverse advantages to hosts. Generally, endosymbi- 
onts reside inside insect cells, sometimes in unique abdominal 
structures called bacteriomes (Wernegreen and Wheeler 
2009). Endosymbionts have reduced genomes and maintain 
functions that allow their hosts to live on nutrient-deficient 
diets such as plant sap or blood (Moya et al. 2008). 
Endosymbionts from different bacterial phyla have been stud- 
ied from various sap-sucking insects, mainly aphids and insects 
in the suborder Auchenorrhyncha (Moran 2007). 

Flavobacteria have been found as endosymbionts of 
several scale insects (Hemiptera: Coccoidea). Until now 
they have been reported within the families Diaspididae, 
Lecanodiaspididae, Coccidae, Eriococcidae, Pseudococcidae, 
Ortheziidae, Coelostomidiidae, and Monophlebidae (Zchori- 
Fein et al. 2005; Gruwell et al. 2007; Matsuura et al. 2009; 
Gruwell et al. 2010; Dhami et al. 2012; Rosenblueth et al. 
2012). Flavobacteria have been considered to be the primary 



endosymbionts of scale insects (Gruwell et al. 2007; 
Rosenblueth et al. 2012). The genome of Uzinura diaspidicola 
(endosymbiont of Diaspididae) was recently sequenced 
(Sabree et al. 2012). Flavobacteria from scale insects 
seem to have a single origin, except for Brownia rhizoecola 
(from Pseudococcidae: Phenacoccinae) tribe Rhizoecini 
(Rosenblueth et al. 2012). Some endosymbiotic 
Flavobacteria form a clade on the basis of 1 6S rRNA sequences 
and have been proposed to cospeciate with their 
Monophlebidae hosts (Rosenblueth et al. 2012). Besides 
scale insects, two other Flavobacteria have been reported as 
insect endosymbionts, Sulcia muelleri f rom Auchenorrhyncha 
and Blattabacterium spp. from basal termites and cock- 
roaches, and their genomes have been fully sequenced (Wu 
et al. 2006; McCutcheon and Moran 2007; Lopez-Sanchez 
et al. 2009; McCutcheon et al. 2009; Sabree et al. 2009; 
McCutcheon and Moran 2010; Woyke et al. 2010; Neef 
et al. 2011; Huang et al. 2012; Sabree et al. 2012; Patino- 
Navarrete et al. 2013; Tokuda et al. 2013). In scale insects, 
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Enterobacteriaceae endosymbionts have been frequently 
found to coexist with Flavobacteria, including insects from 
Diaspididae (Rosenblueth et al. 2012); although the genome 
of U. diaspidicola has been sequenced from an insect that only 
has a single endosymbiont species (Sabree et al. 201 2). Llaveia 
axin axin (Monophlebidae) harbors both endosymbionts pre- 
viously found in scale insects: a f lavobacterium and an entero- 
bacterium (Rosenblueth et al. 2012). 

Llaveia axin axin (Llave) (called "niij" by native people) is a 
giant scale mainly restricted to tropical lowland regions of the 
states of Michoacan, Guerrero, and Chiapas in Mexico, and in 
Guatemala, although there are reports previous to 1995 of its 
presence in the Mexican states of Guanajuato, Veracruz, and 
Yucatan (Ben-Dov 2005). It is characterized by its use in the 
manufacture of native traditional crafts that provide an eco- 
nomic benefit to the local people. A yellow fat that is obtained 
from the female insect is used to prepare a lacquer to coat 
traditional art crafts making them resistant to heat, water, and 
decay. The lacquer can be mixed with other natural products 
to obtain different colors. It has been used on eating utensils 
without toxic effects and also as a medicine unguent for ex- 
ternal wounds or pain (Williams and MacVean 1 995; Martinez 
2006). 

During its early stages of development, the niij establishes 
on the young leaves and stems of its host plants, specially 
Acacia cochliacantha, A. angustissima (reclassified as 
Acaciella angustissima), Spondias sp., and Jatropha curcas 
(Rincon-Rosales and Gutierrez-Miceli 2008; Suazo-Ortuho 
et al. 2013) where it sucks the plant sap. As the insect 
grows, it keeps moving on the plant until it becomes an 
adult and reaches the principal trunk. Three-year-old plants 
can have around 300-400 insects but only during a short time 
in the year (from July to October). Locals collect the females at 
the end of the rainy season, rendering most of them to obtain 
the lacquer, but preserving a few to obtain eggs (around 500 
eggs per female). Locals place these eggs at the crown of the 
plants when the rainy season begins and the eggs hatch a few 
hours later (MacVean 1999). The populations of L. axin axin 
have declined due to its overexploitation, as well as overgraz- 
ing, forest fires, and deforestation of the host plants. As they 
are considered pests of some commercial crops, such as 
Spondias purpurea, they have been eliminated (Rincon- 
Rosales and Gutierrez-Miceli 2008; Suazo-Ortuho et al. 
2013). Host plant species of L. axin axin have in common 
the production of tannins (Rincon-Rosales and Gutierrez- 
Miceli 2008; Islam et al. 201 1; de Sousa Araujo et al. 2012) 
which could be toxic compounds for insects. Endosymbiotic 
bacteria could have a role in the insect detoxification as in the 
case of pesticide detoxyfication in stinkbugs by their 
Burkholderia symbiont (Werren 2012). More importantly, 
monophlebids feed on plant sap, which is a poor source of 
essential nitrogen compounds (Sandstrom and Moran 1999). 

We present here the complete genome sequence of the L. 
axin axin f lavobacterial endosymbiont, a comparative genomic 



analysis with other f lavobacterial endosymbionts, as well as an 
analysis of its metabolic complementarity with the enterobac- 
terial endosymbiont. 

Materials and Methods 

Nomenclature 

In this article we did not use the word "Candidatus" as part of 
the name for symbiotic bacteria, and we italicized the genus 
and species (e.g., 5. muelleri instead of Candidatus Sulcia 
muelleri). 

DNA Extraction, PCR, and Cloning 

Seven L. axin axin female adults were collected from each of 
the following host plants: Acaciella angustissima (Ejido Flores 
Magon, Mpo. Venustiano Carranza), Jatropha curcas, and 
Spondias purpurea (Chiapa de Corzo City) from the state of 
Chiapas, Mexico. Total DNA from the freshly collected insects 
was extracted as reported earlier (Rosenblueth et al. 2012). 
PCR was performed with bacterial universal 1 6S rRNA primers 
fD1 (5 / -AGAGTTTGATCCTGGCTCAG-3 / ) and rD1 (5'-AAGGA 
GGTGATCCAGCC-30, which amplify products of about 1 ,500 
bases (Weisburg et al. 1991). The PCR products were cloned 
and 60 individual plasmid clones were sequenced by 
Macrogen Inc. (Korea). Sequences were compared with the 
nt database of NCBI using the BlastN algorithm. 

Fluorescent In Situ Hybridization of the Bacteriome 

Fluorescent in situ hybridization (FISH) was performed as 
described by Koga et al. (2009) with some modifications. 
Twenty-day-old freshly collected L. axin axin first instar 
nymphs (fig. 1a) were dehydrated with a 30-100% ethanol 
series, fixed overnight in Carnoy's solution, washed with eth- 
anol, and treated for a few days in 6% hydrogen peroxide in 
80% ethanol. The samples were washed several times with 
absolute ethanol, then with xylene, and embedded in paraffin. 
They were cut into 10-jim sections with a rotary microtome 
and mounted on silane-coated glass slides. Sections were 
dewaxed through several washes with xylene and ethanol. 
Hybridization buffer with 100nM of the probe was added 
to the samples and was incubated at 28 °C overnight in a 
humidified chamber. The oligonucleotide probe used was 
Cy5_DcFlv1 450 (^-CyS-ATACCTCCGACTTCCAGGA-^), 
which targets 16S of Flavobacteria of Drosicha sp. 
(Matsuura et al. 2009) and L axin axin. After washing with 
PBS, the samples were stained with 2 jig/ml of DAPI and they 
were mounted with citifluor antifade solution. In order to 
confirm the specificity of the probe, control experiments 
were performed with no probe, RNAse digestion, and com- 
petitive suppression with excess unlabelled probes. The slides 
were observed under a Zeiss LSM510 META confocal 
microscope. 
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Fig. 1. — Llaveia axin axin insects, (a) Twenty-day-old nymphs on Jatropha curcas plants, (b) adult females on Jatropha curcas, (c) dissection of left side of 
abdomen of adult female in a dorsal plane showing the lobed bacteriome (BAC) and ovaries (OV), gut, and malpighian tubules were removed. 



DNA Preparation and Sequencing 

Bacteriomes were dissected in PBS from two frozen (-80 °C) 
adult females collected in the summer of 2010 in Ejido Flores 
Magon, Mpo. Venustiano Carranza, Chiapas, Mexico (fig. lb 
and c). DNA from the bacteriomes was purified using the 
Qiagen Dneasy Blood and Tissue Kit. Six micrograms of the 
purified DNA was used for lllumina HiSeq 2000 sequencing by 
the Next Generation Sequencing Division, Macrogen Inc. 

For 454 sequencing, 5 jig of DNA were prepared from ho- 
mogenized and filtered (20 and 1 1 jim pore size filters) 
abdomens of ten fresh adult females collected from the 
same site, using the Qiagen Dneasy Blood and Tissue Kit. 
Pyrosequencing was carried out in a Roche GS-FLX machine 
at the Virginia Bioinformatics Institute at Virginia Tech, USA. 

Genome Assembly and Annotation 

The lllumina Genome Analyzer System generated 61 ,593,058 
paired-end reads of 100nt with an insert size of 455 nt. The 
454 run generated 78,087 single-end reads with an average 
length of 190nt. 

Velvet (Zerbino and Birney 2008) was used to make a 
hybrid assembly with the lllumina and 454 reads generating 
118 contigs with an N50 of 231,364. All contigs were com- 
pared with the nt database of NCBI using the BlastN algorithm 
and only two contigs had high-scoring alignments with se- 
quences from Flavobacteria. 

A second assembly was run using Phrap (Gordon et al. 
1998) taking the velvet contigs as input, generating a single 
circular contig of 309,299 bp which corresponded to the chro- 
mosome sequence of the flavobacterial endosymbiont. 



Average coverage per nucleotide was 1571.4x. Protein- 
coding genes were predicted using Glimmer3, 
GeneMark.hmm, and Blast; tRNAs and a tmRNA were iden- 
tified with tRNAscan-SE; and rRNAs were identified using the 
web version of WU-BLAST against the Rfam 1 1 .0 sequence 
library. 

Gene function annotation of the predicted protein-coding 
genes was based on results of BlastP searches against the 
RefSeq database and of hidden Markov model searches of 
the Pfam and TIG RFAM databases. The GenePRIMP pipeline 
(Pati et al. 2010) was used to search for gene call anomalies 
and the resulting report was used to perform manual curation 
of the genome. 

Metabolic Reconstruction 

The flavobacterial endosymbiont metabolic pathways were 
constructed by hand using the Ecocyc and Metacyc databases 
and the KEGG Automatic Annotation Server assignments as 
guides. 

Comparative Genomics 

Average nucleotide identity between the genomes of S. muel- 
leri CARI (from Clastoptera arizonana), Uzinura diaspidicola 
ASNER (from Aspidiotus nerii), Blattabacterium sp. BPLAN 
(from Periplaneta americana), and the flavobacterial endosym- 
biont of L. axin axin was calculated with custom Perl scripts 
using Mummer alignments output. 

Orthologous genes and the core genome of flavobacterial 
endosymbionts were determined based on BlastP matches 
between all genes of the four genomes with a high score 
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>75 using the CoreGenes server (Zafar et al. 2002). To show 
the syntenous blocks of genes between flavobacterial endo- 
symbionts, the genome of the flavobacterial endosymbiont of 
L axin axin was aligned versus the 5. muelleri, U. diaspidicola, 
Blattabacterium sp., and the free-living pathogen 
Flavobacterium psychrophilum genomes using PROmer 
(Kurtz et al. 2004) and plotted in figure 5. 

Phylogenetic Analysis 

Flavobacteria 16S rRNA sequences from endosymbionts 
were aligned with ClustalW (Thompson et al. 1994) and 
manually edited. TIM2 + G model was chosen by 
FindModel (http://www.hiv.lanl.gov/content/sequence/find 
model/findmodel.html; last accessed March 17, 2014. 
Posada and Crandall 2001) using Akaike Information 
Criterion (Akaike 1974). Gaps were not considered and 
443 positions were analyzed. Maximum likelihood analysis 
was performed using PhyML 3.0 (Guindon et al. 2010) with 
1,000 bootstrap replicates. 

Enterobacteria phylogeny was built by aligning the amino 
acid sequences of housekeeping genes {rpoA, rpoB, rpoD, 
rpoH, nusA, nusB, gyrA, pykA, dnaE, and DNA primase) con- 
served in 20 y-proteobacteria species including the enterobac- 
terial endosymbiont of L axin axin and other insect 
endosymbionts with reduced genomes. The alignments 
were concatenated and all positions containing gaps and miss- 
ing data were eliminated, leaving a total of 5,565 positions in 
the final data set. The best model search and phylogenetic 
analysis were conducted in MEGA5 (Tamura et al. 2011). 
The evolutionary history was inferred by using the 
Maximum Likelihood method based on the Whelan and 
Goldman + Freq. model with 1,000 bootstrap replicates. 

GenBank accession numbers from reported sequences 
used to construct the phylogenies are shown in supplemen- 
tary table S1, Supplementary Material online. 

Results 

L. axin axin Has Two Bacterial Endosymbionts in 
Bacteriomes 

Only two phylotypes were detected in all the insect female 
adults sampled, Enterobacteriaceae and Flavobacteria. 
Figure 1c shows one of the two symbiotic organs (bacter- 
iomes) located in the abdominal area of the insect. Figure 2 
shows that Flavobacteria are localized in the bacteriomes and 
that each bacteriome consists of around six lobes. The same 
type of bacteriome has been found in Drosicha spp., which 
also belongs to Monophlebidae family (Matsuura et al. 2009). 
Illumina sequences obtained from bacteriome DNA confirmed 
that the enterobacterial symbiont is also present in the bacter- 
iome. It has also been located in ovary and eggs by PCR. 

The maternally inherited endosymbionts of Monophlebidae 
as well as the two large lobed bacteriomes where they are 



located had been described earlier by Walczuch (1932) and 
Tremblay (1989). Monophlebidae has been considered a 
family of the superfamily Coccoidea in several recent studies 
(Hodgson and Foldi 2006; Gullan and Cook 2007). Tremblay's 
(1989) study of the endosymbionts also led him to treat 
Monophlebidae as a distinct family. 

Proposed Name for Monophlebidae Scale Insects 
Flavobacterial Endosymbionts 

We propose the name "Candidatus Walczuchella mono- 
phlebidarum" for the flavobacterial endosymbiont living 
inside bacteriocytes of insects from the family 
Monophlebidae. The name Walczuchella has been chosen 
to honor Walczuch (1932) who described the morphology 
of the bacteriomes in Monophlebidae. Walczuchella: 
Wal.czuch'el.la. N.L. fern. dim. n. Walczuchella, named after 
Walczuch. Previous phylogenetic studies have shown that all 
the Flavobacteria from Monophlebidae belong to the same 
clade and suggest that they have cospeciated with their insect 
hosts in this family (Rosenblueth et al. 2012). That is why we 
propose the species name to be monophlebidarum: mo.no. 
phle.bi.da'rum N.L. fern. pi. n. Monophlebidae, a zoological 
family name; N.L. gen. pi. n. monophlebidarum, of 
Monophlebidae. Further analyses should be done to deter- 
mine whether other Flavobacteria that have been previously 
obtained from insects of the family Coccidae and 
Lecanodiaspididae whose 16S rRNA sequences are phyloge- 
netically related to Walczuchella monophlebidarum 
(Rosenblueth et al. 2012) could belong to the same species 
(fig. 3a). 

General Genomic Features 

The genome of W. monophlebidarum of L. axin axin consists 
of a circular chromosome of 309,299 bp with an average 
G + C content of 32.6% and a coding density of 86.2%. It 
encodes 33 tRNAs corresponding to 20 aminoacids, a single 
rRNA operon and one tmRNA. There were identified 271 pro- 
tein-coding sequences (CDSs), 8 of which were classified as 
hypothetical proteins and the rest had assigned putative bio- 
logical functions. Twenty-seven CDSs were classified as pseu- 
dogenes because of the presence of frameshifts, early stop 
codons, or because the encoded protein had less than 50% of 
the length of its closest ortholog in the databases. It is impor- 
tant to note that some of the pseudogenes with frameshifts in 
homopolymeric tracts may still conserve functionality (Tamas 
et al. 2008; Wernegreen et al. 2010). 

Metabolic Reconstruction of the Flavobacterial 
Endosymbiont Genome 

In figure 4, a reconstruction of W. monophlebidarum metab- 
olism is shown. Most genes are involved in protein and amino 
acids synthesis and RNA processing. W. monophlebidarum 
has a minimal set of genes for genome replication, 
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Fig. 2. — Localization of Walczuchella monophlebidarum (green) by fluorescent in situ hybridization in 20-day-old nymphs of Llaveia axin axin. (a) Dorsal 
tissue sections of the whole body showing the pair of large lobed bacteriomes where flavobacteria reside, (b) sagittal tissue section of the whole body 
showing one lobed bacteriome, (c) enlarged image of a bacteriome lobe. 



transcription, and translation. The replication related genes 
code for DNA polymerase III subunits (oc/s, p, y/x, 5, and 5') 
and DNA gyrase. For transcription, the RNA polymerase core 
subunits (a, p, and P') are present along with their associated 
g70 and a54 factors. For translation, the complete set of 
ribosomal proteins is retained along with the three ribosomal 
RNAs and translation initiation factors (I, II, and III), elongation 
factors (G, P, Tu, and Ts), and peptide chain release factors (1 
and 2). 

Almost all of the genes necessary to synthesize the ten 
essential amino acids are present; however, some of them 
are annotated as pseudogenes and seven are absent. 
Biosynthetic pathways for methionine, threonine, tryptophan, 
and arginine are complete; however, some genes encoding 
intermediate enzymes in the pathways are annotated as pseu- 
dogenes and might not be functional. Pathways for biosyn- 
thesis of phenylalanine, histidine, lysine, and branch-chained 
amino acids are incomplete. 

The tRNA synthetases for methionine, asparagine, alanine, 
and aspartate are missing. 

A complete gene set encoding the twin-arginine transloca- 
tion system (tatABQ is present. Genes for an additional pro- 
tein translocation system were also found. These correspond 
to the Sec system, which is formed by the secYEG operon that 
encodes a protein conducting channel embedded in the mem- 
brane and secA that encodes an ATPase that drives transloca- 
tion. All genes are present except secG and spread over the 
genome not in operons. The gene encoding the auxiliary pro- 
tein YidC for the biogenesis of both translocation systems is 
present. 

Four subunits of the FOF1 ATP synthase are encoded in the 
genome. Nonetheless atpABG are annotated as pseudogenes 
and only atpF seems to be conserved. However, the 



flavobacterial genome encodes electron transport proteins: a 
NADH dehydrogenase, a cbb3-type cytochrome c oxidase 
complex, and two cytochrome c family proteins, which the 
flavobacterium can use to produce ATP. 

The only genes related to vitamin and cofactor biosynthesis 
in the genome are ispB, menA, and menB from the menaqui- 
none biosynthetic pathway. 

Most of the glycolytic pathway and the TCA cycle are lost 
except for the pathway converting 2-oxoglutarate to 
succinate. 

As in other insect endosymbionts with reduced genomes, 
W. monophlebidarum lacks genes related to cell envelope 
biogenesis (fatty acids, phospholipids, and peptidoglycan), 
DNA recombination, cell motility, defense response, and 
transporters (McCutcheon and Moran 2012). 

The reduced genetic repertoire of the W. monophlebi- 
darum genome suggests that it depends on its host or on 
the secondary symbiont to complement its metabolic and cel- 
lular processes. 

Comparative Genomics between Flavobacterial 
Endosymbionts 

Walczuchella monophlebidarum from L axin axin shares 
85.6% average genomic nucleotide identity with U. diaspidi- 
cola, 86.8% with Blattabacterium sp. BPLAN, and 87.9% with 
5. muelleri CARI, which are the closest bacterial relatives with 
a sequenced genome. Some characteristics of other highly 
reduced genomes from different insect endosymbionts are 
shown in table 1 . 

At the genetic level W. monophlebidarum shares 209 ho- 
mologous genes with U. diaspidicola, 21 5 genes with 5. muel- 
leri and 250 with Blattabacterium sp. There is functional 
conservation between these flavobacterial symbionts but no 
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Table 1 

Characteristics of Highly Reduced Genomes from Different Insect Endosymbionts 

Endosymbiont Class Host Host Diet Genome G + C% CDSs rRNAs tRNAs Pseudo- Coding % 

Size (bp) a genes 



Flavobacterium psychrophilum 


Flavobacteria 


Free livincj 




7 OC1 QQQ 


33 
jj 




1 R 
I £5 






oD.D 


Walczuchella monophlebidarum 


Flavobacteria 


Scale insects 


Phloem sap 


309,299 


32.6 


271 


3 


33 


27 


86.2 


Uzinura diaspidicola 


Flavobacteria 


Scale insects 


Parenchyma 


263,431 


30.2 


227 


3 


30 


7 


89.3 


Blattabacterium sp. MADAR 


Flavobacteria 


Termites 


Wood 


590,336 


28 


550 


3 


34 


9 


94.9 


Blattabacterium sp. BPLAN 


Flavobacteria 


Cockroaches 


Omnivorous 


640,442 


28 


580 


3 


33 


0 


94.6 


Sulcia muelleri CARI 


Flavobacteria 


Spittlebugs 


Sap 


276,511 


21 


246 


3 


29 


0 


93.2 


Buchnera aphidicola APS 


Gammaproteobacteria 


Aphids 


Sap 


655,725 


26 


575 


3 


32 


1 


87.5 


Buchnera aphidicola Cc 


Gammaproteobacteria 


Aphids 


Sap 


422,434 


20.2 


362 


3 


31 


3 


86.9 


Carsonella ruddii PV 


Gammaproteobacteria 


Psyllids 


Phloem sap 


159,662 


17 


182 


3 


28 


0 


97.3 


Moranella endobia 


Gammaproteobacteria 


Mealybugs 


Sap 


538,294 


43.5 


406 


5 


41 


28 


79 


Tremblaya princeps PCIT 


Betaproteobacteria 


Mealybugs 


Sap 


138,927 


59 


121 


6 


12 


2 


72.5 


Zinderia insecticola CARI 


Betaproteobacteria 


Spittlebugs 


Sap 


208,564 


14 


202 


3 


25 


0 


92.4 


Hodgkinia cicadicola Dsem 


Alphaproteobacteria 


Cicadas 


Sap 


143,795 


58 


169 


3 


15 


0 


94 



a Genome size includes chromosome and plasmids if present. 



significant genomic synteny. Nevertheless, it can be observed 
that there is more synteny disruption between 5. muelleri or 
Blattabacterium sp. and W. monophlebidarum than between 
U. diaspidicola and W. monophlebidarum (fig. 5). The closer 
evolutionary relationship between these last two bacteria can 
be appreciated in the 16S rRNA gene phylogeny of different 
endosymbiotic Flavobacteria (fig. 3a) and has been observed 
in previously reported phylogenies (Gruwell et al. 2007; 
Gruwell et al. 2010; Rosenblueth et al. 2012). The core 
genome of the four endosymbionts corresponds to 157 
genes. There are seven genes present in U. diaspidicola, 
S. muelleri, and Blattabacterium sp. but absent in W. mono- 
phlebidarum. These genes encode for DNA mismatch repair 
protein MutL, replicative DNA helicase DnaB, methionyl-tRNA 
synthetase, two enzymes in the phenylalanine biosynthetic 
pathway AroE and AspC, malic enzymes MaeA/B, which cat- 
alyze the decarboxylation of malate to form pyruvate, and 
glyceraldehyde-3-phospate dehydrogenase (GapA) required 
in glycolysis. 

Some genes have been lost or pseudogenized in W. mono- 
phlebidarum but conserved in 5. muelleri and U. diaspidicola, 
even though these latter two species have more reduced ge- 
nomes (table 2). 

Table 3 shows some of the specific differences found in 
flavobacterial endosymbionts. This comparison showed that 
U. diaspidicola lost all the genes that other flavobacterial sym- 
bionts retained related to energy production, menaquinone 
biosynthesis and protein transport. On the other hand, U. 
diaspidicola has two genes related to vitamin metabolism 
not found in other flavobacterial symbionts, folC for folic 
acid metabolism and thiL for thiamine biosynthesis. Only 
Blattabacterium sp. retained the folate biosynthetic pathway. 

In general, the set of amino acid biosynthetic genes of 
W. monophlebidarum and U. diaspidicola is the same, with 



the exception of two genes {aspC and argF) not found in 
W. monophlebidarum. Both W. monophlebidarum and 
U. diaspidicola lost ilvE which encodes the enzyme for the 
last step in the branched-chain amino acid biosynthesis path- 
way while 5. muelleri and Blattabacterium sp. have this gene. 
Similarly, 5. muelleri and Blattabacterium sp. have dapF, a 
gene from the lysine biosynthetic pathway that is absent in 
W. monophlebidarum and U. diaspidicola. 

Walczuchella monophlebidarum, U. diaspidicola, and 
Blattabacterium sp. retained two sigma factors RpoD and 
RpoN which is unusual in insect endosymbiont genomes. 

Only Blattabacterium sp. possesses the ability to synthesize 
the cell envelope components. The glycolytic pathway as well 
as the TCA cycle are essentially absent in W. monophlebi- 
darum, U. diaspidicola, and 5. muelleri. 

Amino Acid Biosynthesis Related Genes in the 
Enterobacterial Endosymbiont 

From the 454 and lllumina sequences, a draft assembly of the 
enterobacterial endosymbiont genome was obtained consist- 
ing of 679 scaffolds with an N50 of 7,713 and an average 
G + C content of 55.6%. Lengths of scaffolds sum 3.4 Mb of 
sequence, and taking that as the genome size, we calculate a 
genome coverage of 34x. We searched for all the essential 
amino acid biosynthesis genes in the enterobacterial endo- 
symbiont sequences and made a reconstruction of its amino 
acid biosynthetic pathways. The enterobacterial symbiont has 
the potential to synthesize the 10 essential amino acids. Only 
the aspC and argC genes were not found in the enterobacte- 
rial scaffolds. Different from W. monophlebidarum, it retains 
the cobalamin-dependent methionine synthase to synthesize 
methionine from homocysteine and vitamin B2. The entero- 
bacterial symbiont also has the capacity to synthesize 
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Fig. 5. — Genomic alignments of Walczuchella monophlebidarum of Llaveia axin axin (y axis) versus (A) Uzinura diaspidicola (B) Sulcia muelleri CARI, (0 
Blattabacterium sp. BPLAN, and (D) Flavobacterium psychrophilum. Forward matches shown in red. Reverse matches shown in blue. 



nonessential amino acids. It can produce tyrosine from chor- 
ismate and homocysteine from homoserine. 

It is also capable of degrading allantoin to urea and then 
hydrolizing it into C0 2 and ammonia, which can then be used 
for glutamine and glutamate production. 

It is important to point out that most of the amino acid 
biosynthesis genes that were absent or pseudogenized in 
W. monophlebidarum genome were present in the entero- 
bacterial symbiont genome. 

The size of the enterobacterial symbiont genome is similar 
to free-living bacteria suggesting a recent change to a 



symbiotic lifestyle, as in its close relative Sodalis glossinidius 
(Toh etal. 2006). 

Discussion 

The genetic content of W. monophlebidarum genome sug- 
gests a role in synthesizing essential amino acids for the host. 
According to its metabolic reconstruction, the flavobacterium 
needs to be provided with some precursors for the amino acid 
biosynthesis like PEP for phenylalanine and tryptophan pro- 
duction, ribulose-5P for histidine synthesis, pyruvate for 
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Table 2 

Genes Retained in the Sulcia muelleri and Uzinura diaspidicola Genomes That Are Absent in the Genome of Walczuchella monophlebidarum of 
Llaveia axin axin 



S. muelleri Genes 


Related Function 


U. diaspidicola 
Genes 


Related Function 


aroE 


Chorismate biosynthesis 


aroE 


Chorismate biosynthesis 


aspC 


Phenylalanine biosynthesis 


aspC 


Phenylalanine biosynthesis 


aspS 


Aspartyl-tRNA biosynthesis 


dnaB 


Replication 


atpCDEH 


ATP synthase su bun its 


eno 


Glycolysis 


dapF 


Lysine biosynthesis pathway 


folC 


Folic acid metabolism 


dnaB 


Replication 


gapA 


Glycolysis 


fabF 


Fatty acids biosynthesis 


glgA 


Glycogen biosynthesis 


fbaA 


Glycolysis 


maeB 


Malate metabolism 


gapA 


Glycolysis 


metG 


Methionyl-tRNA biosynthesis 


HvE 


val, leu, ile biosynthesis 


mutL 


DNA mismatch repair 


maeA 


Malate metabolism 


pgk 


Glycolysis 


marC 


Integral membrane protein 


queA 


RNA modification 


mutL 


DNA mismatch repair 


rpe 


Pentose degradation 


putA 


Transcription regulator 


sodA 


Oxidative stress response 


ubiE 


Menaquinone biosynthesis 


thiL 


Thiamine biosynthesis 


yaeT 


Outer membrane assembly 


tktC 


Pentose catabolism 


primase 


Replication 


tktN 


Pentose catabolism 


M42 family metaloprotease 


Metaloprotease 


trxA 


Electron transfer 


tal 


Transaldolase 


Ribokinase 


Ribose catabolism 


Mrp/Nbp35 ATP-binding protein 


ATP-binding 


CMP_dCMP deaminase 


Deaminase 


Signal peptidase 


Signal peptidase 






Methy transferase 


Methyltransferase 







branchecl-chain amino acid production, and some non-essen- 
tial amino acids for arginine, methionine, lysine, and threonine 
synthesis (fig. 4). 

The enterobacterial symbiont could be supplying most of 
these precursors as it has the potential to make ribulose-5P 
from ribose-1 P and also PEP and pyruvate from glycolysis. It is 
also capable of producing homocysteine from homoserine for 
methionine biosynthesis. 

Interestingly, the enterobacterial symbiont could recycle 
the waste nitrogen from the insect in the form of allantoin 
to provide precursors for amino acid biosynthesis. 
Nitrogen recycling potential has also been reported in 
Blattabacterium sp. but through a different pathway. It is pos- 
sible that W. monophlebidarum could assimilate distinct nitro- 
gen products as it retains RpoN, a nitrogen-related gene 
regulator. 

The metabolic precursor production by the enterobacterial 
symbiont suggests metabolic complementarities between the 
two endosymbionts. However, their genetic content for es- 
sential amino acid biosynthesis overlaps. This might mean that 
1) both endosymbionts supply the insect host with essential 
amino acids or that 2) because of the loss and degradation of 
many essential genes for viability and amino acid biosynthesis 
in W. monophlebidarum, it has become incapable of fulfilling 
its role in the symbiotic relationship with the insect, and the 
enterobacterial symbiont now fulfills the flavobacterium's 
former functions. 



As with Sodalis glossinidius and Wigglesworthia glossinidia 
(Snyder et al. 2010), the primary endosymbiont may have the 
capacity to produce a nutrient that the secondary symbiont 
needs and is not capable of producing, in this case thiamine. 
This could lead to the stable coexistence of both 
endosymbionts. 

Walczuchella monophlebidarum and U. diaspidicola have 
essentially the same genetic potential for amino acid biosyn- 
thesis, the main difference being the presence of frameshifts 
in some genes of these pathways of W. monophlebidarum. 
Sulcia muelleri differs from them because it has a secondary 
symbiont capable of synthesizing methionine and histidine, 
capabilities that have been lost in 5. muelleri. 

Perhaps the more degraded state of metabolic pathways 
of W. monophlebidarum in comparison with those of 
U. diaspidicola, which has a similar host and environment, 
can be explained by the presence of the enterobacterial sym- 
biont in L axin axin, as this reduces the selection forces on the 
flavobacterium genes. On the other hand, it is interesting 
how S. muelleri, with a similar life style in an insect 
host with a similar diet and also accompanied by a secondary 
endosymbiont as in the case of the L. axin axin 
flavobacterium, has maintained amino acid biosynthetic path- 
way's integrity. 

The high number of pseudogenes in the W. monophlebi- 
darum genome compared with other reduced genomes sug- 
gests that it is still under a genomic reduction process. Despite 
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Table 3 



Comparison of Metabolic Components Differentialy Conserved between Flavobacterial Endosymbionts 





Walczuchella 


Uzinura diaspidicola 


Sulcia muelleri 


Blattabacterium 




monophlebidarum 






sp. BPLAN 


Energy production 


ATP synthase 


Few subunits 


Absent 


Present 


Present 


Cytochrome C 


Present 


Absent 


Present 


Present 


Cbb3-type cyt C oxydase 


Present 


Absent 


Present 


Present 


NADH dehydrogenase 


Present 


Absent 


Present 


Present 


Amino acid biosynthesis 


HvE 


Absent 


Absent 


Present 


Present 


dapF 


Absent 


Absent 


Present 


Present 


His biosynthesis 


Present 


Present 


Absent 


Present 


metE 


Present 


Present 


Absent 


Present 


aspC 


Absent 


Present 


Present 


Present 


argF 


Absent 


Present 


Present 


Present 


Vitamins and cofactors 


Menaquinone biosynthesis 


Incomplete 


Absent 


Incomplete 


Incomplete 


folC 


Absent 


Present 


Absent 


Present 


thiL 


Absent 


Present 


Absent 


Absent 


Protein transport 


Sec system 


Present 


Absent 


Present 


Present 


Tat system 


Present 


Absent 


Present 


Present 


Replication and transcription 


dnaE 


Present 


Present 


Present 


Present 


dnaQ 


Present 


Absent 


Present 


Present 


dnaN 


Present 


Absent 


Present 


Present 


dnaX 


Present 


Absent 


Absent 


Present 


rpoD 


Present 


Present 


Present 


Present 


rpoN 


Present 


Present 


Absent 


Present 


Cell envelope 


Fatty acid biosynthesis 


Absent 


Absent 


Only fabF 


Present 


Phospholypids 


Absent 


Absent 


Absent 


Present 


Peptidoglycans 


Absent 


Absent 


Absent 


Present 


Carbohydrate metabolism 


Glycolysis 


Incomplete 


Incomplete 


Incomplete 


Present 


TCA cycle 


Incomplete 


Incomplete 


Incomplete 


Present 



the lack of genomic synteny observed in the flavobacteria of 
sap-feeding insects, they have a similar genome size and a 
significant functional convergence. 

Supplementary Material 

Supplementary table S1 is available at Genome Biology and 
Evolution online (http://www.gbe.oxfordjournals.org/). 
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